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Abstract 


Severe acute respiratory syndrome coronavirus (SARS-CoV) 3C-like protease (3CL?"°) mediates exten- 
sive proteolytic processing of replicase polyproteins, and is considered a promising target for anti-SARS 
drug development. Here we present a rapid and high-throughput screening method to study the substrate 
specificity of SARS-CoV 3CL?". Six target amino acid positions flanking the SARS-CoV 3CLP"° 
cleavage site were investigated. Each batch of mixed peptide substrates with defined amino acid 
substitutions at the target amino acid position was synthesized via the “cartridge replacement” approach 
and was subjected to enzymatic cleavage by recombinant SARS-CoV 3CL?"°. Susceptibility of each 
peptide substrate to SARS-CoV 3CL?”® cleavage was monitored simultaneously by matrix-assisted laser 
desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). The hydrophobic pocket in 
the P2 position at the protease cleavage site is crucial to SARS-CoV 3CL?"°-specific binding, which is 
limited to substitution by hydrophobic residue. The binding interface of SARS-CoV 3CL?” that is facing 
the Pl’ position is suggested to be occupied by acidic amino acids, thus the P1’ position is intolerant to 
acidic residue substitution, owing to electrostatic repulsion. Steric hindrance caused by some bulky or B- 
branching amino acids in P3 and P2’ positions may also hinder the binding of SARS-CoV 3CL””. This 
study generates a comprehensive overview of SARS-CoV 3CL?" substrate specificity, which serves as 
the design basis of synthetic peptide-based SARS-CoV 3CL?” inhibitors. Our experimental approach is 
believed to be widely applicable for investigating the substrate specificity of other proteases in a rapid and 
high-throughput manner that is compatible for future automated analysis. 
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Severe acute respiratory syndrome (SARS) is a life- 
threatening, infectious respiratory illness characterized 
by high fever, chills, rigor, myalgia, dry cough, and 
headache, which may further progress to a dramatic 
increase in lung opacity, dyspnea, and hypoxemia (Dros- 
ten et al. 2003; Lee et al. 2003). The etiological agent of 
SARS was identified as a novel type of human corona- 
virus, SARS-CoV (Drosten et al. 2003; Ksiazek et al. 
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2003; Peiris et al. 2003). Coronaviruses are large, envel- 
oped, single-stranded positive-sense RNA viruses that 
replicate in the cytoplasm of the infected host cell (Sid- 
dell et al. 1983). Phylogenetic studies reveal SARS-CoV 
does not belong to any of the three known groups of the 
Coronaviridae family; thus, it is suggested that SARS- 
CoV defines a distinct fourth group of coronaviruses 
(Marra et al. 2003; Rota et al. 2003). The RNA genome 
of SARS-CoV is ~30 kb in length, with five major open 
reading frames (ORFs) encoding the replicase polypro- 
tein, the spike (S) protein, the envelope (E) protein, the 
membrane (M) protein, and the nucleocapsid (N) pro- 
tein (Marra et al. 2003; Rota et al. 2003). 

Investigations of various SARS-CoV proteins can facil- 
itate the development of vaccines and drugs against the 
SARS disease. The pivotal role of SARS-CoV surface S 
protein in mediating viral entry sheds light on the genera- 
tion of S protein-specific neutralizing antibodies (Galla- 
gher and Buchmeier 2001; Choy et al. 2004), and some of 
the experimental vaccines have been tested in the murine 
viral replication model recently (Bisht et al. 2004; Yang et 
al. 2004). Meanwhile, the 33.8-kDa SARS-CoV main 
protease, also known as the SARS-CoV 3C-like protease 
(SARS-CoV 3CL?"*) is being considered extensively as an 
attractive and promising target for anti-SARS drug 
design, owing to its important biological role in the viral 
life cycle (Anand et al. 2003). 

SARS-CoV 3CL?"° mediates extensive proteolytic pro- 
cessing of two overlapping replicase polyproteins, ppla 
(486 kDa) and pplab (790 kDa), to yield the correspond- 
ing functional polypeptides that are essential for SARS- 
CoV replication and transcription processes (Herold et al. 
1993; Ziebuhr et al. 1995; Rota et al. 2003). There are at 
least 11 conserved SARS-CoV 3CL?” cleavage sites on 
the SARS-CoV polyprotein pplab, in which the substrate 
specificity involves preferentially the Leu-GlnJ sequence 
in the P2 and P1 positions at the cleavage site, respectively 
(Ziebuhr et al. 2000; Heygi and Ziebuhr 2002; Anand et 
al. 2003). SARS-CoV 3CL?"° comprises three domains, 
including the catalytic domains I and I that form the N- 
terminal chymotrypsin fold, and the unique extra helical 
domain I] in C-terminal that is important for regulating 
the activity and specificity of SARS-CoV 3CL?”° (Shi et 
al. 2004). The reported X-ray crystal structures of the 
SARS-CoV 3CLP"® allows the investigation of possible 
interactions of SARS-CoV 3CLP*° with its substrates, 
which contributes to the development of specific protease 
inhibitors as potential anti-SARS drugs (Anand et al. 
2003; Yang et al. 2003). 

A better understanding of the specific enzyme sub- 
strate-binding mechanism during the SARS-CoV 3CL?”° 
proteolytic cleavage process is required for the rational 
design of an effective protease inhibitor with strong bind- 
ing affinity to SARS-CoV 3CL?"’. To screen the substrate 
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specificity of SARS-CoV 3CL?"° in a rapid and high- 
throughput manner in contrast to the traditional tedi- 
ous procedures, we applied the matrix-assisted laser 
desorption/ionization time-of-flight mass spectrometric 
(MALDI-TOF MS) analysis in combination with the 
novel “cartridge replacement” solid-phase peptide synthe- 
sis approach to investigate the biological significance of 
amino acid residues in the P2, P3, P4, Pl’, P2’, and P3’ 
positions that are flanking the conserved Gln residue in 
the Pl position at the SARS-CoV 3CL?" cleavage site 
(Schechter and Berger 1967; Fan et al. 2004, 2005). Owing 
to the sensitivity of MALDI-TOF MS, the protease cleav- 
age progress of a mixture of different peptide substrates 
can be monitored simultaneously to generate compre- 
hensive data on the substrate specificity of SARS-CoV 
3CL?"’. Therefore, all of the 20 possibilities of amino acid 
substitutions in a particular position of the peptide sub- 
strate can be easily investigated in the same reaction. Our 
study can provide insights into the molecular mechanism 
of the SARS-CoV 3CL?" cleavage process and reveal 
the feasibility of developing synthetic peptide-based pro- 
tease inhibitors as potential drugs against SARS- 
CoV and other coronavirus infections. Furthermore, the 
rapid approach we described in this study could be widely 
applicable for monitoring the cleavage activities of other 
proteases, such as allowing the rapid screening of poten- 
tial protease inhibitors of the human immunodeficiency 
virus (HIV) as the basis of drug development. In addition, 
automations are highly compatible and feasible with 
our experimental procedures to cope with the increasing 
demand of high-throughput analysis in proteomic 
research. 


Results 
Substrate specificity preferences of SARS-CoV 3CL?’° 


Positive control of the SARS-CoV 3CL?"® cleavage 

assay 

The PSO1 peptide was amidated and acetylated at the C 
and N termini, respectively. The corresponding amino 
acid sequence of PSO1 was based on one of the SARS- 
CoV 3CLP” cleavage sites on the SARS-CoV BJO1 poly- 
protein pplab (residues 3232-3247) and was used as the 
positive control in the enzymatic cleavage assay to assess 
the activity of the recombinant SARS-CoV 3CL”®. The 
original PSO1 peptide peak with m/z value of 1739.03 
before the enzymatic assay was absent from the mass 
spectrum after the cleavage reaction (Fig. 1) as it was 
being cleaved into two peptide fragments with their cor- 
responding m/z values (data not shown), which validated 
the enzymatic activity of the recombinant SARS-CoV 
3CL?” that we prepared. 
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Figure 1. Primary screening: Mass spectrometry analysis of SARS-CoV 3CL?”° cleavage products. (A) Positive control. PSO1 
peptide peak (*) was resolved before cleavage assay (A-/) and was absent from the mass spectrum after the reaction (A-2). (B) P2 
position. For PS02 peptide substrates, all 20 peptide peaks were resolved on the mass spectrum before cleavage assay (B-/), and 
only the peptide peaks corresponding to Leu (*) and Phe (**) substitutions in the P2 position were absent, whereas all other peaks 
remained after the cleavage reaction (B-2). (C) P3 position. For PS03 peptide substrates, all 20 peptide peaks were resolved on the 
mass spectrum before cleavage assay (C-/), and only the peptide peak with Pro (*) substitution in the P3 position remained after 
the reaction (C-2). (D) P4 position. For PS04 peptide substrates, all 20 peptide peaks were resolved on the mass spectrum before 
cleavage assay (D-/), which were all absent from the mass spectrum after the reaction (D-2). (E) PI’ position. For PS05 peptide 
substrates, all 20 peptide peaks were resolved on the mass spectrum before cleavage assay (E-/), and the peptide peaks with Pro (*), 
Asp (*), and Glu (#*) in the P1’ position remained after the reaction (£-2). (F) P2’ position. For PS06 peptide substrates, all 20 
peptide peaks were resolved on the mass spectrum before cleavage assay (F-/), and the peptide peaks with Pro (*) and Ile/Leu (#*) 
in the P2’ position remained after the reaction (F-2). (G) P3’ position. For PSO7 peptide substrates, all 20 peptide peaks were 
resolved on the mass spectrum before cleavage assay (G-/), which were all absent from the mass spectrum after the reaction (G-2). 


“Cartridge replacement” strategy 


In this study, we used MALDI-TOF MS analysis in 
combination with the solid-phase peptide synthesis 
approach to examine the biological significance of 
amino acid residues in a total of six target positions at 
the SARS-CoV 3CL?® cleavage sites, including the P2, 
P3, and P4 positions at the amino side of the P1 position; 


strategy that we named as the “cartridge replacement” 
solid phase peptide synthesis. Our “cartridge replacement” 
strategy consists of two screening procedures: (1) primary 
screening and (2) secondary screening. 


1. Primary screening using the “cartridge 


and the P1’, P2’, and P3’ positions at the carboxyl side of 
the P1 position (Table 1). 

Based on the flexibility in changing the contents of each 
amino acid cartridge, we designed a novel peptide synthesis 


replacement” strategy 


By using the “cartridge replacement” strategy, the cor- 
responding amino acid cartridge at the target position 
under investigation was deliberately replaced by another 
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Table 1. Amino acid sequence of the seven synthetic peptide substrates 


Sequence* 
Peptide substrate P9 P&8 P7 P6 P5 P4 P3 P2 Pl PI P2 P3 P4’ PS’ PO’ P7’ 
psoi> ToS 2 PSs A Vv L.; :Q Ny G F R K M A 
PS02 > A8:. Sle “By oS vA Vi Xan9° Q S G F R K M A 
PS03 T S I T S A_ Xxe° L Q S G F R K M A 
PS04 Te S< lo SI YS: Xs" L Q S G F R K M A 
PS05 T So LF Sy ¢A L Q Xx G F R K M A 
PS06 T S$ DL Ess. A LQ S Xx° -F R K M A 
PSO7 We oon SEY AR aS SA. L Q S G Xe° R K M A 


* The amino acids flanking the SARS-CoV 3CL?” cleavage sites are labeled from the amino terminus to the 
carboxyl terminus as follows: -P3-P2-P1 J P1’-P2’-P3’, as described previously (Schechter and Berger 1967). 
>The control peptide substrate is based on the amino acid sequence of one of the SARS-CoV 3CL"® 
cleavage sites on the SARS-CoV BJ01 polyprotein pplab (residues 3232-3247). 

©The corresponding amino acid at this position is substituted by a mixture of 20 standard amino acids in 


equal molar ratios. 


cartridge containing a mixture of 20 standard amino 
acids in equal molar ratios during the solid-phase pep- 
tide synthesizing process so as to generate a mixed pool 
of 20 different peptide substrates with single amino acid 
residue alteration at the target position. Each batch of 
the mixed peptide substrates was subjected to in vitro 
enzymatic cleavage assay by recombinant SARS-CoV 
3CLP’° in a single reaction and then analyzed by 
MALDI-TOF MS to detect the susceptibility of each 
of the 20 peptides to cleavage by SARS-CoV 3CLP"”. 
The results from the primary screening of the in vitro 
enzymatic cleavage assay of the six batches of synthetic 
peptide substrates (namely PS02, PSO3, PS04, PSOS, 
PS06, and PSO7) with amino acid substitutions in their 
corresponding target positions are shown in Figure 1 
and summarized in Table 2. Before the enzymatic 
assay, the corresponding peaks of the 20 peptides in 


each substrate pool were resolved on the mass spectrum 
with their exact m/z values, which demonstrated the 
validity of solid-phase peptide synthesis using the 
“cartridge replacement” approach to generate a pool of 
20 peptides with single amino acid difference at the 
target position. The results also demonstrated the cap- 
ability of MALDI-TOF MS to discriminate most of the 
peptide species with single amino acid difference. 


P2, P3, and P4 positions at the amino side 

of the SARS-CoV 3CL?"® cleavage site 

For PS02 peptide substrates with amino acid substitu- 
tions in the P2 position, only the original peptide (m/z 
value of 1739.03) and the peptide with Phe substitution 
for Leu in the P2 position (m/z value of 1773.01) were 
cleaved, with their corresponding peptide peaks absent 
from the mass spectrum after the enzymatic assay, 


Table 2. Summary of results from primary screening 


Amino acid substitution® 


Corresponding? 
Peptide substrate Target position* amino acid Cleaved peptides Uncleaved peptides 
PS02 P2 Leu Leu, Phe Others" 
PS03 P3 Val Others® Pro 
PS04 P4 Ala All’ = 
PS05 Pl’ Ser Others® Pro, Asp, Glu 
PS06 P2’ Gly Others® Pro, Ile/Leu 
PS07 P3’ Phe All’ = 


“The positions flanking the SARS-CoV 3CL?"° cleavage sites are labeled from the N terminus to the C 
terminus as follows: -P3-P2-P1 J P1’-P2’-P3’, as described previously (Schechter and Berger 1967). 
>The original amino acid residue in the position under investigation before the substitution with 20 


standard amino acids. 


© The identity of the amino acid in the particular peptide that substituted the residue at the position under 


investigation. 


4 All peptides other than those in the opposite column were not cleaved by SARS-CoV 3CL””. 
° All peptides other than those in the opposite column were cleaved by SARS-CoV 3CL?”. 
"All 20 peptides were cleaved by SARS-CoV 3CL?”°. 
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whereas all other peptides were not cleaved, with their 
corresponding peaks remaining. For PS03 peptide sub- 
strates with amino acid substitutions in the P3 position, 
only the peptide with Pro substitution for Val in the P3 
position (m/z value of 1737.01) was not cleaved, with 
its corresponding peptide peak remaining in the mass 
spectrum after the enzymatic assay, whereas all other 
peptides were cleaved successfully. For PS04 peptide 
substrates with amino acid substitutions in the P4 posi- 
tion, all of the 20 peptides were cleaved completely, with 
their corresponding peaks absent from the mass spec- 
trum after the enzymatic assay. 


PI’, P2’, and P3’ positions at the carboxyl side 
of the SARS-CoV 3CL?"® cleavage site 


For PS05 peptide substrates, only three peptides with 
Pro substitution (m/z value of 1749.05), Asp substitution 
(m/z value of 1767.02), and Glu substitution (m/z value 
of 1781.04) for Ser in the P1’ position, respectively, were 
not cleaved. For PS06 peptide substrates, only two pep- 
tides with Pro substitution (m/z value of 1779.06) and 
Ile/Leu substitution (m/z value of 1795.09) for Gly in the 
P2’ position, respectively, were not cleaved. For PS07 
peptide substrates with amino acid substitutions in the 
P3’ position, all of the 20 peptides were cleaved com- 
pletely, with their corresponding peaks absent from the 
mass spectrum after the enzymatic assay. 


2. Secondary screening using the “cartridge 
replacement” strategy 


After the primary screening procedure, we generated the 
comprehensive cleavage profiles of the corresponding 20 
peptide substrates of each target amino acid position 
simultaneously, based on the “cartridge replacement” 
strategy. However, there were peptides (carrying ambigu- 
ous residues like He/Leu and Gln/Lys that carry the iden- 
tical or close masses) in which true cleavage might not be 
identified with absolute confidence. A secondary screening 
procedure was necessary, and we proceeded to the syn- 
thesis of target peptide substrates into defined batches, 
particularly for those peptides with identical or close mo- 
lecular masses, which resulted in ambiguous peaks for de- 
tection on the mass spectra in the primary screening 
procedure. 

The results of the in vitro enzymatic cleavage assay of 
five peptide batches (namely, PSO2—1, PS02-2, PS02-3, 
PS06-I, and PS06-L) are shown in Figure 2 and summa- 
rized in Table 3. For PS02-1 peptide batches, only the 
original peptide with Leu in the P2 position (m/z value of 
1739.03) was cleaved, with its corresponding peptide 
peaks absent from the mass spectrum after the enzymatic 
assay, whereas the other three peptides with Pro, Cys, and 


Glu substitutions for Leu in the P2 position were not 
cleaved, with their corresponding peaks remaining. For 
PS02-2 and PS02-3 peptide batches, not all of the 
peptides were cleaved, with their corresponding peaks 
remaining. For PS06-I and PS06-L peptide batches, 
both the peptides with Ile substitution (m/z value of 
1795.09) and Leu substitution (m/z value of 1795.09) for 
Gly in the P2’ position, respectively, were not cleaved, 
with their corresponding peaks remaining. 


Molecular docking of SARS-CoV 3CL?"° 
and peptide substrates 


Based on the mass spectrometry results from the in vitro 
enzymatic cleavage assay (Table 2), four target positions 
including P2, P3, Pl’, and P2’ positions located at the 
SARS-CoV 3CL?P" cleavage sites were further analyzed 
by the Insight IJ molecular modeling platform, and the 
molecular docking results are shown in Figures 3 and 4. 
The control peptide, PSO1, was docked to the predefined 
active site of SARS-CoV 3CL?”, with the conserved Gln 
residue located at close proximity to Cys145—His41 resi- 
dues at the catalytic dyad of SARS-CoV 3CL?” (Fig. 4), 
in which this total energy data (182.74 kcal/mol) gener- 
ated from energy minimization was used as the reference 
for comparing the docking results using other peptide 
substrates. This slightly positive total energy value is due 
to the relatively high repulsive van der Waals energy 
attributed by the narrow binding pocket on SARS- 
CoV 3CLP”. 

For the PS02 peptide with Phe substitution in the P2 
position, the total energy of 186.98 kcal/mol was similar 
to that of the PSO1 control peptide (182.74 kcal/mol), 
which further confirmed the tolerance of Phe substitu- 
tion at the P2 position at the cleavage site (Fig. 4). On 
the other hand, the total docking energy for PSO3 pep- 
tide with Pro substitution in the P3 position (1028.33 
kcal/mol); PSO5 peptides with Pro substitution (1385.02 
kcal/mol), Asp substitution (1384.69 kcal/mol) and Glu 
substitution (1384.91 kcal/mol) in the PI’ position; and 
PS06 peptides with Pro substitution (1383.25 kcal/mol), 
Ile substitution (1381.35 kcal/mol), and Leu substitution 
(1387.60 kcal/mol) in the P2’ position were at least five- 
fold higher than the total energy of the PSO1 control 
peptide (182.74 kcal/mol). The relatively high total 
docking energy revealed the unfavorable cleavage reac- 
tions of these peptides by SARS-CoV 3CL?"® (Fig. 4). 


Discussion 


We have introduced a rapid and_ high-throughput 
screening method to generate a comprehensive overview 
of the substrate specificity preferences of SARS-CoV 
3CLP” via a combinatory approach that uses MALDI- 
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Figure 2. Secondary screening: Mass spectrometry analysis of SARS-CoV 3CL 
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Pro cleavage products. The monoisotopic peaks 


of each peptide can be clearly shown. (A) PS02-1 peptide batch. The four peaks of peptides with Pro (P; m/z = 1722.90), Cys (C; 
m/z = 1728.86), Leu (L; m/z = 1739.03), and Glu (E; m/z= 1754.89) substitutions were resolved on the mass spectrum before 
cleavage assay (A-/); only the peptide peaks corresponding to Leu (*) substitutions were absent, whereas all other peaks 
remained after the reaction (A-2). (B) PS02-2 peptide batch. The three peaks of peptides with Val (V; m/z = 1724.92), Asn (N; m/ 
z= 1739.89), and Lys (K; m/z= 1753.95) substitutions were resolved on the mass spectrum before cleavage assay (B-/), which 
were all remaining after the reaction (B-2). (C) PS02-3 peptide batch. The four peaks of peptides with Thr (T; m/z = 1726.90), Ile 
(I; m/z = 1739.03), Asp (D; m/z = 1740.88), and Gln (Q; m/z = 1753.91) substitutions were resolved on the mass spectrum before 
cleavage assay (C-/), which all were remaining after the reaction (C-2). The 1740.88 monoisotopic peak with high intensity (*) 
belongs to the mother peak of the peptide with Asp (D) substitution. (D) PS06-I peptide batch. The peak of peptide with He (I; 
m/z = 1795.09) substitution was resolved on the mass spectrum before cleavage assay (D-/), which remained after the reaction 
(D-2). (E) PS06-L peptide batch. The peak of peptide with Leu (L; m/z=1795.09) substitution was resolved on the mass 
spectrum before cleavage assay (E-/); which remained after the reaction (E£-2). 


TOF MS and the novel “cartridge replacement” solid- 
phase peptide synthesis strategy. Research on the sub- 
strate specificity preferences of SARS-CoV 3CL?® is 
gaining increasing attention, owing to the biological sig- 
nificance of SARS-CoV 3CL?"° in the viral life cycle and 
the feasibility to design specific SARS-CoV 3CL?”® inhib- 
itors as potential anti-SARS drugs. It has been explicitly 
demonstrated that SARS-CoV 3CL?"” cleaves the coro- 
navirus replicase polyproteins at no less than 11 con- 
served sites, preferentially involving the Leu-Glnl 
sequence in the P2 and P1 positions at the cleavage site, 
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respectively (Ziebuhr et al. 2000; Heygi and Ziebuhr 2002; 
Anand et al. 2003). To further extend our knowledge of 
the substrate specificity of SARS-CoV 3CL?"®, we aimed 
to comprehensively study the biological importance of 
amino acid residues in six selected positions (namely P2, 
P3, P4, Pl’, P2’, and P3’) flanking the conserved Pl 
position at the SARS-CoV 3CL?® cleavage site by a 
rapid and high-throughput approach (Table 1). 

From the primary screening procedure, the results of 
the PS03, PS04, PSOS, and PSO7 peptide substrates can 
be interpreted precisely. From the results of PS04 and 
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Table 3. Summary of results from secondary screening 


Peptide Amino acid Cleaved Uncleaved 
batch substitution peptides peptides 
PS02-1 __Pro,* Cys,” Leu,° Glu® Leu Pro, Cys, Glu 
PS02-2 Val,* Asn,°* Lys4 — Val, Asn, Lys 
PS02-3 Thr,” Ile,° Asp,° Gin‘ = Thr, Ile, Asp, Gln 
PS06-I Tle" — Ile 

PS06-L_ Leu! — Leu 


*Pro (m/z value of 1722.90) and Val (m/z value of 1724.92) substitu- 
tions with close masses were separated into PS02—1 and PS02-2 peptide 
batches, respectively. 

> Cys (m/z value of 1728.86) and Thr (m/z value of 1726.90) substitu- 
tions with close masses were separated into PS02—1 and PS02-3 peptide 
batches, respectively. 

“Leu (m/z value of 1739.03), Asn (m/z value of 1739.89), and Ile (m/z 
value of 1739.03) substitutions with close masses were separated into 
PS02-1, PSO02-2, and PS02-3 peptide batches, respectively. 

4 Glu (m/z value of 1754.89), Lys (m/z value of 1753.95), and Gln (m/z 
value of 1753.91) substitutions with close masses were separated into 
PS02-1, PS02—2, and PS02-3 peptide batches, respectively. 

© Asp (m/z value of 1740.88) and Asn (m/z value of 1739.89) substitu- 
tions with close masses were separated into PS02—2 and PS02-3 peptide 
batches, respectively. 

"Tle and Leu substitutions with identical masses (m/z value of 1795.09) 
were separated into PS06-I and PS06-L peptide batches, respectively. 


PSO7 peptide substrates (Fig. 1), all 20 peptide peaks 
were absent from the mass spectra after the SARS- 
CoV 3CL?” cleavage reaction: The SARS-CoV 3CLP"° 
can cleave all of the 20 peptides regardless of the types of 
amino acid substitutions at the P3’or P4 positions. For 
PSO03 and PSOS5 peptide substrates, the corresponding 
peptide peaks (m/z value of 1737.01 for PSO3 and m/z 
values of 1749.05, 1781.04, and 1767.05 for PS05) 
retained on the mass spectra could be clearly resolved 
after the cleavage reaction (Fig. 1); the SARS-CoV 
3CLP” cannot cleave those peptides for which the non- 
overlapping m/z values were shown, and hence, those 
uncleaved peptides can be easily identified. 

For PS02 peptide substrates, there were only two 
peptide peaks that vanished after the cleavage reaction. 


Although these two peaks can be identified as corre- 
sponding to their exact m/z values, several remaining 
peptide peaks with close masses introduced ambiguity 
in the mass spectra. In addition, for PS06 peptide sub- 
strates, there were only two peptide peaks found on the 
mass spectra after the cleavage reaction. One of the 
peptide peaks (m/z value of 1795.09) may contain two 
other peptide entities with exact m/z values, i.e., the He/ 
Leu containing peptide. In fact, this result was also 
observed in one of the cleaved peaks of PSO2 peptide 
substrates (m/z value of 1739.03). Therefore, a secondary 
screening procedure using the defined batch synthesis 
approach (Table 3) was used to solve the above problem. 

In addition to the well-known conserved Gln residue in 
the P1 position at the SARS-CoV 3CL?"® cleavage site, 
the P2 position, which is exclusively occupied by Leu 
residue, also serves as another important determinant of 
substrate specificity. Our peptide cleavage results indi- 
cated that only the peptide substrate with Phe replace- 
ment in the P2 position was also favorable for SARS- 
CoV 3CL?”® cleavage (Fig. 1; Table 2), with similar total 
energy level (186.98 kcal/mol) in molecular docking when 
compared with the total energy of the positive control 
peptide (182.74 kcal/mol) with Leu in the P2 position. 
Also, the results of the three peptide batches (PS02-1, 
PS02—2, and PS02-3) in the secondary screening proce- 
dure further confirmed the results obtained from the 
primary screening procedure. The PS02-3 peptide batch 
showed that Ile was intolerant in the P2 position for 
SARS-CoV 3CL?"°. Our results demonstrated consis- 
tency with other findings (Fan et al. 2004, 2005), suggest- 
ing that the P2 position serves as an important hy- 
drophobic pocket for SARS-CoV 3CL?"°-specific bind- 
ing; thus, the peptide substrate with replacement of the 
original Leu with the hydrophobic Phe in the P2 position 
was still susceptible to SARS-CoV 3CL?”® cleavage (Fig. 
4). From the results of a previous time course cleav- 
age experiment (data not shown), we observed that the 


Figure 3. Interaction between SARS-CoV 3CL?° and PSO1 control peptide as predicted by molecular docking. Overview (lef?) 
of the interaction between the SARS-CoV 3CL?” (purple) and peptide substrate PSO1 (red) and the zoom-in view (right) of the 
conserved Gln residue (gray) in the P1 position at the cleavage site of the PSO1 control peptide being docked into the catalytic 
dyad of SARS-CoV 3CLP"°, which is composed of the Cys145 (orange) and His41 (yellow) residues. 
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Figure 4. Comparison between the molecular models of different 
selected peptide substrates docked to the active site of SARS-CoV 
3CLP*°. The Cys145 (orange) and His41 (yellow) residues in the cata- 
lytic dyad of SARS-CoV 3CL?"° (purple) and the conserved Gln resi- 
due (gray) in the P1 position at the cleavage site of the peptide substrate 
(red) were shown. (A) PS02 peptide with Phe (green) in the P2 position 
was in close proximity to the active site of SARS-CoV 3CL?"”. Peptide 
substrates that were unfavorable for SARS-CoV 3CL?” cleavage were 
deviated away from the active site of SARS-CoV 3CL?", which include 
PSO3 peptide with Pro (blue) in the P3 position (B); PSO05 peptide with 
Pro (blue) in the Pl’ position (C); PSO5 peptide with Asp (blue) in the 
Pl’ position that was in close proximity to the acidic residues (brown) 
at the active site (D); PSO5 peptide with Glu (blue) in the P1’ position 
that was in close proximity to the acidic residues (brown) at the active 
site (E); PS06 peptide with Pro (blue) in the P2’ position (F); PS06 
peptide with Ile (blue) in the P2’ position (G); PS06 peptide with Leu 
(blue) in the P2’ position (4). 


SARS-CoV 3CL?"” proteolytic cleavage rate of the posi- 
tive control peptide was higher than that of the peptide 
with Phe substitution in the P2 position, which accounted 
for the dominant occurrence of Leu in the P2 position. 
Other studies also demonstrated that SARS-CoV 3CL?"° 
favorably tolerates Phe and Val substitutions, and to a 
lesser extent, for Met and Ile substitutions in the P2 
position (Fan et al. 2004, 2005), hence revealing the sig- 
nificance of the P2 hydrophobic pocket in the determina- 
tion of SARS-CoV 3CL?” substrate specificity. 
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Furthermore, the Pl’ position, which is frequently 
occupied by Ser residue, also contributes to the substrate 
specificity of SARS-CoV 3CL?”® considerably. Our pep- 
tide cleavage results revealed that the Pl’ position is 
highly unfavorable to the substitution by Pro, Asp, and 
Glu residues (Fig. 1; Table 2), which is consistent with 
molecular docking results showing their total energy 
levels being higher than that of the positive control pep- 
tide by more than fivefold. Our findings on the intoler- 
ance of Asp or Glu substitutions in the Pl’ position 
suggest that the acidic residues at the binding interface 
of the SARS-CoV 3CL?”* active site are probably in close 
proximity and sterically complementary to the P1’ posi- 
tion of the substrate, leading to electrostatic repulsions 
between the negatively charged acidic residues that hinder 
the substrate binding to SARS-CoV 3CL”®. Our molec- 
ular docking results also demonstrated that the Pl’ posi- 
tion is proximal to the Glu47 and Asp48 residues at the 
SARS-CoV 3CL?® active site (Fig. 4), which further 
confirmed our interpretations. Also, the unfavorable sub- 
stitution by Pro in the PI’ position could be accounted for 
by the steric hindrance arising from its cyclic side chain 
(Fig. 4), which obstructs the substrate binding to SARS- 
CoV 3CL?°. Previous studies also demonstrated that 
small aliphatic residues such as Ser, Ala, and Gly are 
favored in the PI’ position (Fan et al. 2005). 

Moreover, our results demonstrated that the substrate 
specificity of SARS-CoV 3CL?"® are less dependent on 
the P2’ and P3 positions at the cleavage site. The substi- 
tutions of Pro, Ile, or Leu in the P2’ position and the 
substitution of Pro in the P3 position in the peptide sub- 
strates are unfavorable for SARS-CoV 3CL?® cleavage 
(Fig. 1; Table 2), in which these bulky Pro residue and B- 
branching Ile and Leu residues may cause steric hin- 
drance to binding of SARS-CoV 3CLP"® with its sub- 
strates (Fig. 4). The results of PS06-I and PS06-L 
peptide batches further confirmed the results of the pri- 
mary screening: Neither Ile nor Leu is tolerant in the P2’ 
position for SARS-CoV 3CL?"°. On the other hand, the 
peptide cleavage results showed that the P3’ and P4 posi- 
tions have no effect on determining the substrate specifi- 
city preferences of SARS-CoV 3CL?”® (Fig. 1; Table 2). 

In contrast to providing quantitative measurements on 
the kinetic data of the interaction between SARS-CoV 
3CL?® and the substrates, the scope of our studies mainly 
focuses on the description of a useful tool for rapid and 
comprehensive screening of substrate specificity for SARS- 
CoV 3CL?®, which is also applicable to the studies of other 
proteases. By combining the MALDI-TOF MS and syn- 
thetic peptide-based approaches, peptide-cleavage studies 
on the defined equal molar mixture of a batch of 20 peptide 
substrates before and after the protease cleavage reaction 
could be performed simultaneously on a single MALDI- 
TOF MS analysis. The MALDI-TOF mass spectrometer is 
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a delicate and sensitive instrument that enables clear and 
precise discrimination of different peptides, even with single 
amino acid difference. Therefore, each of the 20 peptide 
substrates with equal molar ratios in the same mixture can 
be monitored simultaneously on a single MALDI-TOF MS 
analysis before and after the protease cleavage reaction. 
Our approach is advantageous over the traditional meth- 
ods that are restricted to testing single peptides with defined 
condition one at a time and the setup of a total of 20 
identical experimental conditions for every analysis. After 
generating the comprehensive cleavage profile on the 20 
peptides simultaneously based on the primary screening 
procedure of the “cartridge replacement” strategy, we can 
easily identify some of the biologically important residues 
taking part in the recognition process. Nevertheless, we 
employed the secondary screening procedure using the 
defined batch-synthesis approach to synthesize target pep- 
tides in different batches for exact identification purpose 
(ambiguous residues like Ile/Leu and Gln/Lys that carry 
identical or close masses). Our approach provides an option 
in which the resolution of the MALDI-TOF mass spec- 
trometer instrument is not sensitive enough to distinguish 
between species with 1-Da mass difference. Also, MALDI- 
TOF MS analysis is tolerant to impurities; therefore, the 
reaction mixture can be analyzed directly after a simple 
desalting procedure, which reduces the workload of subse- 
quent downstream purifications of cleavage products prior 
to analysis. 

For the synthesis of peptide substrates, we deliberately 
applied the novel “cartridge replacement” strategy, which 
was based on the flexibility on modifying the solid-phase 
peptide synthesis process, in which each amino acid car- 
tridge can be filled with different contents of amino acids 
in defined ratios, to generate the desired combinations 
of mixed peptide substrates according to experimental 
needs. By the incorporation of MALDI-TOF MS anal- 
ysis and “cartridge replacement” peptide synthesis ap- 
proach, we presented a useful tool that is widely 
applicable to studies that involve the monitoring of pro- 
tease cleavage activity, which includes the rapid screening 
of potential protease inhibitors of other coronaviruses 
and the human immunodeficiency virus (HIV). On the 
basis of our studies, future experiments will be carried out 
to address the effectiveness of various synthetic peptide- 
based inhibitors in blocking the protease activity, with the 
ultimate aim of anti-SARS drug development. 

In this study, we have combined the sensitivity, the high 
resolution of MALDI-TOF MS, and the flexibility of the 
“cartridge replacement” solid-phase peptide synthesis 
approach to generate SARS-CoV 3CL?” proteolytic cleav- 
age data rapidly in a comprehensive manner. Our studies 
definitely provide insights into the full automation of such 
experimental procedures to meet the increasing demand of 
high-throughput and automated proteomic studies. 


Materials and methods 


Protein expression and purification 


The coding sequence of SARS-CoV 3CL?*° used for cloning 
was based on the SARS-CoV BJO1 strain complete genome 
sequence downloaded from GenBank (accession no. AY 
278488; http://www.ncbi.nlm.nih.gov/entrez). The construc- 
tion of the pGEX-3CL?” plasmid has been described pre- 
viously by Dr. Rao’s team at Tsinghua University (Yang et 
al. 2003). The pGEX-3CL?”° plasmid was transformed into 
Escherichia coli strain BL21(DE3)-competent cells and plated 
on Luria-Bertani (LB) agar containing ampicillin (100 pg/mL)/ 
chloramphenicol (25 pg/mL) and grown overnight at 37°C. A 
single colony was inoculated into 50 mL of LB-broth contain- 
ing ampicillin (100 wg/mL) and grown overnight at 37°C with 
shaking. This overnight culture was then transferred into 5 L of 
LB-broth containing ampicillin (100 g/mL) for large-scale 
protein expression at 37°C with shaking. When the culture 
was grown to an ODeoo0nm Of 0.6, it was induced with 0.1 mM 
isopropyl-1-thio-B-D-galactopyranoside (IPTG) and grown 
for an additional 6 h at 25°C. Cells were harvested by 20-min 
centrifugation at 3200g at 4°C, and the bacterial cell pellet was 
resuspended in lysis buffer containing 10 mM Tris-HCl (pH 
7.5), 500 mM NaCl, 1 mM dithiothreitol (DTT), 0.5 mM 
EDTA, and 0.2 mM phenylmethylsulfonyl fluoride (PMSF), 
and homogenized by sonication. The lysate was centrifuged at 
48,000g for 45 min at 4°C, and the supernatant was loaded 
onto a Glutathione Sepharose 4B affinity column (Amersham 
Biosciences) equilibrated with the lysis buffer. The column was 
then washed with 10x column volume of lysis buffer. To cleave 
the GST-tag from the GST-3CL?” fusion protein, PreScission 
Protease (Amersham Biosciences) was added into the column, 
and the mixture was incubated overnight at 4°C. The SARS- 
CoV 3CLP® recombinant protein was eluted with the lysis 
buffer, concentrated to 10 mg/mL by Microcon Centrifugal 
Filter Devices (Millipore), and then stored at —80°C. 


“Cartridge replacement’ solid-phase peptide synthesis 


The sequence of peptide substrate used in this study was based 
on one of the SARS-CoV 3CL?”™ cleavage sites on the SARS- 
CoV BJO1 polyprotein pplab (residues 3232-3247) (Table 1). 
The biological significance of amino acid residues in the P2, P3, 
P4, Pl’, P2’, and P3’ positions that are flanking the conserved 
Gln residue in the P1 position at the cleavage site was investi- 
gated one at a time using the novel “cartridge replacement” 
solid-phase peptide synthesis strategy. In a standard procedure 
of solid-phase peptide synthesis, each amino acid cartridge that 
contains a particular type of amino acid is placed side-by-side 
in the peptide synthesizer according to the target peptide 
sequence, and the peptide synthesis begins with the cartridge 
containing the first amino acid at the carboxyl terminus of the 
sequence. In this study, we utilized flexibility in changing the 
contents of each amino acid cartridge and designed a novel 
peptide synthesis strategy that we named as the “cartridge 
replacement” solid-phase peptide synthesis. 


Primary screening 

Prior to peptide synthesis, the corresponding amino acid 
cartridge at the target position under investigation was 
replaced by another cartridge containing a mixture of 20 stan- 
dard amino acids in equal molar ratio, such that a mixed pool 
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of 20 synthetic peptides differing by a single amino acid residue 
was generated as the end product after complete synthesis of 
the substrate sequence. Six sets of such peptide substrate pools 
together with the original peptide substrate were synthesized 
with this approach (Table 1) using a standard solid-phase pep- 
tide synthesis protocol as described previously (Choy et al. 
2004). 


Secondary screening 

The PS02 and PS06 peptide substrates require further anal- 
ysis by a secondary screening. For the PS02 peptide substrates, 
the peptides with close masses were separated into three 
batches: PS02—1, PSO02—2, and PS02-3. For the peptide syn- 
thesis of the PS02-1 batch, the corresponding Leu cartridge at 
the P2 position was replaced by a cartridge containing a mix- 
ture of Cys, Glu, Leu, and Pro in an equal molar ratio. For the 
peptide synthesis of the PS02—2 batch, the corresponding Leu 
cartridge at the P2 position was replaced by a cartridge con- 
taining a mixture of Asn, Lys, and Val in an equal molar ratio. 
For the peptide synthesis of the PS02-3 batch, the correspond- 
ing Leu cartridge at the P2 position was replaced by a cartridge 
containing a mixture of Asp, Gln, Ile, and Thr in an equal 
molar ratio. For the PS06 peptide substrates, only the Ile and 
Leu substitutions were further investigated and separated into 
two batches: PS06-I and PS06-L. For the peptide synthesis of 
PS06-I and PS06-L batches, the corresponding Gly cartridge at 
the P2’ position was replaced by a cartridge containing Ile or 
Leu, respectively. 

All of the peptide substrates were synthesized by the solid- 
phase technique with an in-house Applied Biosystems 433A 
Peptide Synthesizer on preloaded resins using standard 9-fluo- 
renylmethoxycarbonyl (Fmoc) synthesis chemistry with pi- 
peridine deprotection and 2-(1H-benzotriazole-1-yl)-1,1,3,3- 
tetramethyluronium hexafluorophosphate (HBTU)/N-hydroxy- 
benzotriazole (HOBt) activation. The amino (NH)) termini 
of peptides were acetylated on the resin with acetic anhydride 
capping solution containing 0.5 M acetic anhydride, 0.125 M 
N,N-diispopropylethylamine (DIEA), 0.015 M N-hydroxyben- 
zotriazole (HOBt) in N-methylpyrrolidone (NMP). The acety- 
lated peptides were cleaved from the resins, and side-chain 
protecting groups were removed by cleavage solution contain- 
ing 25 g/L ethanedithiol (EDT) and 50 g/L thioanisole in tri- 
fluoroacetic acid (TFA). The cleaved peptides were then 
precipitated with cold diethyl ether, washed, and lyophilized. 
The identities of the synthetic peptides were confirmed by mass 
spectrometry on an Ettan MALDI-TOF Pro Mass Spectrom- 
eter (Amersham Biosciences). 


Peptide cleavage assay and MS analysis 


The in vitro peptide cleavage assay was performed in a total 
reaction volume of 25 wL (pH 7.6), containing 1.5 ~M recom- 
binant SARS-CoV 3CL?"°, 0.5 mM sum total concentration of 
the peptide substrate batch, 20 mM Na»HPOg,, 200 mM NaCl, 
1 mM EDTA, and 1 mM DTT, with overnight incubation at 
25°C. Cleavage products were desalted by ZipTipyu-cig (Milli- 
pore), and the corresponding peptide peaks before and after 
the cleavage reactions were resolved by mass spectrometry on 
an Ettan MALDI-TOF Pro Mass Spectrometer (Amersham 
Biosciences) or an ABI 4700 MALDI TOF/TOF Analyzer 
(Applied Biosystems). For acquisition of mass spectra, 0.5- 
wL aliquots of samples were dispensed onto the MALDI- 
plate, followed by a 0.5-wL matrix solution (5 g/L a-cyano-4- 
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hydroxycinnamic acid in 50% acetonitrile/0.1% trifluoroacetic 
acid). All spectra were acquired in a reflectron-positive ion 
mode and accumulated from 1000 laser shots with acceleration 
of 20 kV and pulsed extraction. Angiotensin III (m/z value of 
897.523) and adrenocorticotropic hormone (m/z value of 
2465.199) were used as standards for internal calibration of 
MALDI-TOF MS spectra. 


Molecular docking 


All three-dimensional molecular modeling studies were per- 
formed on an Insight II molecular modeling platform (Accelrys 
Software Inc.) run on Silicon Graphics Octane2 workstations 
(Silicon Graphics Inc.). The consistent valence force field 
(CVFF) was selected in all simulations. The X-ray crystal 
structure of the SARS-CoV 3CLP” (1UK4) was downloaded 
from the Protein Data Bank (PDB; http://www.rcsb.org/pdb) 
and used as a starting model. The catalytic active site of chain 
A in the SARS-CoV 3CL?” structure without the bound CMK 
peptide was used in the study. Hydrogen atoms were added 
for the structure of SARS-CoV 3CL? chain A using the 
BUILDER module. Then, the force field potentials and partial 
charges were assigned. The resulting structure was subse- 
quently subjected to energy minimization using the DIS- 
COVER module and was used as the initial structures for 
molecular docking. Since the proteolytic activity SARS-CoV 
3CL?”® mainly relies on the active nucleophile Cys145 and the 
acid-base catalyst His41, the region containing all residues 
(10.0 A radius) around this Cys—His dyad was selected as the 
target site and was used to set up as a grid with the DOCKING 
module. The nonbond energies of this region were precal- 
culated on the grid. Different peptide substrates were then 
docked into the target site by manual docking using the 
DOCKING module. The best docking position was based on 
the docking energy of the peptide/SARS-CoV 3CL?° complex. 
Thousands of orientations of peptides were searched by ma- 
neuvering in the Cys-His dyad region of SARS-CoV 3CL?"° 
manually until reaching the energy minimum. The resultant 
complex was subjected to energy minimization and molecular 
dynamics simulation using the DISCOVER module. A dielec- 
tric constant of 1.0 was used throughout the energy minimi- 
zation and molecular dynamics simulation. The system was 
equilibrated at 300 K for 100 psec, during which time the 
potential energy of the system reached a stable value. A time 
step of 1 fsec was used to integrate the equation of motion. The 
final conformation of the structure was obtained through 5000 
iterations of steepest descent energy minimization. Ultimately, 
the total docking energy between the peptide substrate and 
SARS-CoV 3CL?” in the energy-minimized complex was cal- 
culated using the DOCKING module. The docking energy 
included the van der Waals and electrostatic energies, which 
were the components of the intermolecular energy. 
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